Search results for " Clustering"

showing 10 items of 312 documents

Estimating Missing Information by Cluster Analysis and Normalized Convolution

2018

International audience; Smart city deals with the improvement of their citizens' quality of life. Numerous ad-hoc sensors need to be deployed to know humans' activities as well as the conditions in which these actions take place. Even if these sensors are cheaper and cheaper, their installation and maintenance cost increases rapidly with their number. We propose a methodology to limit the number of sensors to deploy by using a standard clustering technique and the normalized convolution to estimate environmental information whereas sensors are actually missing. In spite of its simplicity, our methodology lets us provide accurate assesses.

010504 meteorology & atmospheric sciencesComputer sciencemedia_common.quotation_subjectReal-time computingEnergy Engineering and Power Technology02 engineering and technologyIterative reconstructionsmart city dealsCluster (spacecraft)01 natural sciencesIndustrial and Manufacturing Engineeringnormalized convolutionstandard clustering technique[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]ConvolutionArtificial IntelligenceSmart city11. Sustainability0202 electrical engineering electronic engineering information engineeringLimit (mathematics)SimplicityCluster analysisInstrumentationad-hoc sensors0105 earth and related environmental sciencesmedia_commonSettore INF/01 - InformaticaRenewable Energy Sustainability and the EnvironmentComputer Science Applications1707 Computer Vision and Pattern Recognitionenvironmental informationmissing informationComputer Networks and CommunicationKernel (image processing)020201 artificial intelligence & image processingcluster analysis2018 IEEE 4th International Forum on Research and Technology for Society and Industry (RTSI)
researchProduct

Hierarchical networks of food exchange in the black garden ant Lasius niger

2020

In most eusocial insects, the division of labour results in relatively few individuals foraging for the entire colony. Thus, the survival of the colony depends on its efficiency in meeting the nutritional needs of all its members. Here, we characterise the network topology of a eusocial insect to understand the role and centrality of each caste in this network during the process of food dissemination. We constructed trophallaxis networks from 34 food-exchange experiments in black garden ants (Lasius niger). We tested the influence of brood and colony size on (i) global indices at the network level (i.e. efficiency, resilience, centralisation and modularity) and (ii) individual values (i.e. …

0106 biological sciences0301 basic medicinesocial network analysisModularity (biology)Foragingself-organisationsocial network analysesEvolution des espèces01 natural sciencesPhysiologie des invertébrésGeneral Biochemistry Genetics and Molecular Biology03 medical and health sciencesSciences du Vivant [q-bio]/Autre [q-bio.OT]Betweenness centralityBlack garden antAnimalsinsectsSocial Behaviorsocial evolutionEcology Evolution Behavior and Systematicsnetwork evolutionPrincipal Component AnalysisBehavior AnimalbiologyAntsEthologieEcologyLasiusFeeding Behaviorbiology.organism_classificationEusociality010602 entomology030104 developmental biologySpace-Time ClusteringInsect Science[SDE]Environmental SciencesCentralityBiologieAgronomy and Crop ScienceSocial Network AnalysisTrophallaxis
researchProduct

Fuzzy quantification of common and rare species in ecological communities (FuzzyQ)

2021

International audience; Most species in ecological communities are rare, whereas only a few are common. This distributional paradox has intrigued ecologists for decades but the interpretation of species abundance distributions remains elusive.We present Fuzzy Quantification of Common and Rare Species in Ecological Communities (FuzzyQ) as an R package. FuzzyQ shifts the focus from the prevailing species-categorization approach to develop a quantitative framework that seeks to place each species along a rarity-commonness gradient. Given a community surveyed over a number of sites, quadrats, or any other convenient sampling unit, FuzzyQ uses a fuzzy clustering algorithm that estimates a probab…

0106 biological sciencesAssembly rulesFuzzy clustering[SDV]Life Sciences [q-bio]Rare species010603 evolutionary biology01 natural sciencesFuzzy logic03 medical and health sciencesEnvironmental monitoringrarityEcology Evolution Behavior and Systematics030304 developmental biologyenvironmental monitoring0303 health sciencesCommunitybusiness.industryEcological ModelingEnvironmental resource managementassembly rulescommonness15. Life on landGeographyfuzzy clustering[SDE.BE]Environmental Sciences/Biodiversity and Ecologybusinessabundance–occupancy distributionscommunity ecology
researchProduct

Quantum clustering in non-spherical data distributions: Finding a suitable number of clusters

2017

Quantum Clustering (QC) provides an alternative approach to clustering algorithms, several of which are based on geometric relationships between data points. Instead, QC makes use of quantum mechanics concepts to find structures (clusters) in data sets by finding the minima of a quantum potential. The starting point of QC is a Parzen estimator with a fixed length scale, which significantly affects the final cluster allocation. This dependence on an adjustable parameter is common to other methods. We propose a framework to find suitable values of the length parameter σ by optimising twin measures of cluster separation and consistency for a given cluster number. This is an extension of the Se…

0301 basic medicineClustering high-dimensional dataMathematical optimizationCognitive NeuroscienceSingle-linkage clusteringCorrelation clustering02 engineering and technologyComputer Science ApplicationsHierarchical clusteringDetermining the number of clusters in a data set03 medical and health sciences030104 developmental biologyArtificial Intelligence0202 electrical engineering electronic engineering information engineeringCluster (physics)020201 artificial intelligence & image processingQACluster analysisAlgorithmk-medians clusteringMathematicsNeurocomputing
researchProduct

SpCLUST: Towards a fast and reliable clustering for potentially divergent biological sequences

2019

International audience; This paper presents SpCLUST, a new C++ package that takes a list of sequences as input, aligns them with MUSCLE, computes their similarity matrix in parallel and then performs the clustering. SpCLUST extends a previously released software by integrating additional scoring matrices which enables it to cover the clustering of amino-acid sequences. The similarity matrix is now computed in parallel according to the master/slave distributed architecture, using MPI. Performance analysis, realized on two real datasets of 100 nucleotide sequences and 1049 amino-acids ones, show that the resulting library substantially outperforms the original Python package. The proposed pac…

0301 basic medicineComputer science[INFO.INFO-SE] Computer Science [cs]/Software Engineering [cs.SE]Health Informatics[INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE][INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing03 medical and health sciences[INFO.INFO-CR]Computer Science [cs]/Cryptography and Security [cs.CR]0302 clinical medicineSoftware[INFO.INFO-ET] Computer Science [cs]/Emerging Technologies [cs.ET][INFO.INFO-DC] Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Cluster AnalysisHumansCluster analysis[INFO.INFO-CR] Computer Science [cs]/Cryptography and Security [cs.CR]computer.programming_languagebusiness.industry[INFO.INFO-IU] Computer Science [cs]/Ubiquitous ComputingSimilarity matrixPattern recognitionDNAGenomicsSequence Analysis DNAPython (programming language)Mixture model[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationSpectral clusteringComputer Science Applications030104 developmental biologyComputingMethodologies_PATTERNRECOGNITION[INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA][INFO.INFO-ET]Computer Science [cs]/Emerging Technologies [cs.ET][INFO.INFO-MA] Computer Science [cs]/Multiagent Systems [cs.MA][INFO.INFO-MO] Computer Science [cs]/Modeling and SimulationArtificial intelligence[INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]businesscomputerAlgorithmsSoftware030217 neurology & neurosurgery
researchProduct

Retrospective Proteomic Screening of 100 Breast Cancer Tissues.

2017

The present investigation has been conducted on one hundred tissue fragments of breast cancer, collected and immediately cryopreserved following the surgical resection. The specimens were selected from patients with invasive ductal carcinoma of the breast, the most frequent and potentially aggressive type of mammary cancer, with the objective to increase the knowledge of breast cancer molecular markers potentially useful for clinical applications. The proteomic screening; by 2D-IPG and mass spectrometry; allowed us to identify two main classes of protein clusters: proteins expressed ubiquitously at high levels in all patients; and proteins expressed sporadically among the same patients. Wit…

0301 basic medicineGene isoformClinical Biochemistrygel-based proteomiclcsh:QR1-502Motilitysurgical tissuegel-based proteomicsBiologyBioinformaticsProteomicsBiochemistrylcsh:MicrobiologyArticleMetastasis03 medical and health sciencesBreast cancerbreast cancerStructural BiologyMedicineSettore BIO/06 - Anatomia Comparata E CitologiaMolecular Biologyoncology_oncogenicsmass spectrometrysurgical tissuesbusiness.industryCancermedicine.diseasePrimary tumor030104 developmental biologyApoptosisprotein clusteringCancer researchbreast cancer; surgical tissues; gel-based proteomics; mass spectrometry; protein clusteringbusinessProteomes
researchProduct

FragClust and TestClust, two informatics tools for chemical structure hierarchical clustering analysis applied to lipidomics. The example of Alzheime…

2016

Lipidomic analysis is able to measure simultaneously thousands of compounds belonging to a few lipid classes. In each lipid class, compounds differ only by the acyl radical, ranging between C10:0 (capric acid) and C24:0 (lignoceric acid). Although some metabolites have a peculiar pathological role, more often compounds belonging to a single lipid class exert the same biological effect. Here, we present a lipidomics workflow that extracts the tandem mass spectrometry data from individual files and uses them to group compounds into structurally homogeneous clusters by chemical structure hierarchical clustering analysis (CHCA). The case-to-control peak area ratios of the metabolites are then a…

0301 basic medicineHigh-resolution mass spectrometrySettore MED/09 - Medicina InternaChemical structureComputational biologyPlasma biomarkers01 natural sciencesTriglycerideBiochemistryHomogeneous clustersAnalytical ChemistryCeramide03 medical and health sciencesAlzheimer DiseaseTandem Mass SpectrometryHealth informatics toolsLipidomicsHumansStatistical analysisData miningChromatography High Pressure LiquidAgedAged 80 and overMolecular StructureChemistry010401 analytical chemistryLipids0104 chemical sciencesHierarchical clusteringPhospholipid030104 developmental biologyWorkflowBiochemistryCase-Control StudiesSettore MED/26 - Neurologia
researchProduct

A clustering package for nucleotide sequences using Laplacian Eigenmaps and Gaussian Mixture Model.

2018

International audience; In this article, a new Python package for nucleotide sequences clustering is proposed. This package, freely available on-line, implements a Laplacian eigenmap embedding and a Gaussian Mixture Model for DNA clustering. It takes nucleotide sequences as input, and produces the optimal number of clusters along with a relevant visualization. Despite the fact that we did not optimise the computational speed, our method still performs reasonably well in practice. Our focus was mainly on data analytics and accuracy and as a result, our approach outperforms the state of the art, even in the case of divergent sequences. Furthermore, an a priori knowledge on the number of clust…

0301 basic medicineNematoda01 natural sciencesGaussian Mixture Model[STAT.ML]Statistics [stat]/Machine Learning [stat.ML][MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]ComputingMilieux_MISCELLANEOUScomputer.programming_language[STAT.AP]Statistics [stat]/Applications [stat.AP]Phylogenetic treeDNA ClusteringGenomicsHelminth ProteinsComputer Science Applications[STAT]Statistics [stat]010201 computation theory & mathematics[INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA]Data analysisEmbeddingA priori and a posteriori[INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Health Informatics0102 computer and information sciences[INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE]Biology[INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing03 medical and health sciences[INFO.INFO-CR]Computer Science [cs]/Cryptography and Security [cs.CR]Laplacian EigenmapsAnimalsCluster analysis[SDV.GEN]Life Sciences [q-bio]/GeneticsModels Geneticbusiness.industryPattern recognitionNADH DehydrogenaseSequence Analysis DNAPython (programming language)Mixture model[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationVisualization030104 developmental biologyComputingMethodologies_PATTERNRECOGNITIONPlatyhelminths[INFO.INFO-ET]Computer Science [cs]/Emerging Technologies [cs.ET]Programming LanguagesArtificial intelligence[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]businesscomputerComputers in biology and medicine
researchProduct

Autoimmune polyglandular diseases.

2019

Autoimmune polyglandular diseases (APD) are defined as the presence of two autoimmune -induced endocrine failures. With respect to the significant morbidity and potential mortality of APD, the diagnostic objective is to detect APD at an early stage, with the advantage of less frequent complications, effective therapy and better prognosis. This requires that patients at risk be regularly screened for subclinical endocrinopathies prior to clinical manifestation. Regarding the time interval between manifestation of first and further endocrinopathies, regular and long-term follow-up is warranted. Quality of life and psychosocial status are poor in APD patients and involved relatives. Familial c…

0301 basic medicinePediatricsmedicine.medical_specialtyEndocrinology Diabetes and Metabolism030209 endocrinology & metabolismFamilial clusteringClinical manifestationEndocrine System DiseasesAutoimmune Diseases03 medical and health sciences0302 clinical medicineEndocrinologyQuality of lifeMedicineHumansIn patientStage (cooking)Polyendocrinopathies AutoimmuneSubclinical infectionPatient Care Teambusiness.industrymusculoskeletal neural and ocular physiologyIncidence030104 developmental biologycardiovascular systemQuality of LifeInterdisciplinary CommunicationHigh incidenceMorbiditybusinessPsychosocialcirculatory and respiratory physiologyBest practiceresearch. Clinical endocrinologymetabolism
researchProduct

Innovative Strategies to Develop Chemical Categories Using a Combination of Structural and Toxicological Properties.

2016

Interest is increasing in the development of non-animal methods for toxicological evaluations. These methods are however, particularly challenging for complex toxicological endpoints such as repeated dose toxicity. European Legislation, e.g., the European Union's Cosmetic Directive and REACH, demands the use of alternative methods. Frameworks, such as the Read-across Assessment Framework or the Adverse Outcome Pathway Knowledge Base, support the development of these methods. The aim of the project presented in this publication was to develop substance categories for a read-across with complex endpoints of toxicity based on existing databases. The basic conceptual approach was to combine str…

0301 basic medicineQuantitative structure–activity relationshipread acrossPredictive Clustering Tree (PCT) methodComputer science610010501 environmental sciencescomputer.software_genre600 Technik Medizin angewandte Wissenschaften::610 Medizin und Gesundheit01 natural sciences03 medical and health sciencesPharmacology (medical)Cluster analysis0105 earth and related environmental sciencesOriginal ResearchAlternative methodsPharmacologytoxicological and structural similaritybusiness.industryQSARlcsh:RM1-950non-animal methods; QSAR; readacross; Predictive Clustering Tree (PCT) method; toxicological and structural similarityIdentification (information)Tree (data structure)030104 developmental biologyConceptual approachlcsh:Therapeutics. PharmacologyKnowledge basenon-animal methodsData miningWeb servicebusinesscomputerFrontiers in pharmacology
researchProduct